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Abstract. We study epidemic threshold properties in a scale-free random 
graph model. We show via a branching process approximation that the di- 
vergence of the second moment of the degree distribution is equivalent to the 
absence of an epidemic threshold. We study further the relation between di- 
ameter and epidemic threshold. Absence of an epidemic threshold happens 
precisely when a positive fraction of the nodes form a cluster of bounded di- 
ameter. 



1. Introduction 

In the last decade there was an enormous increase of interest in phenomena 
which show for characteristic observables a power-law behavior. Power-law distri- 
butions are often called scale-free distributions due to the absence of characteristic 
sizes. Most prominent under the huge variety of examples became systems which 
show so-called selforganized criticality (SOC), |3] 0|. The basic idea to explain 
the appearance of a power-law-distribution is via the linkage to phase-transitions 
where the appearance of scale-free structures is known since a long time. Despite 
many efforts only little is known about the origin of power-law-distributions on a 
mathematical base. Partially due to the increasing knowledge of the structure of 
the internet there is a growing number of articles studying the structure of networks 
with a power law distribution for the degree (see pQ and the references therein). 

In this article we want to discuss a specific model of scale-free-graphs based on 
a partner choice strategy and its epidemic threshold properties. Related questions 
found recently much interest due to the possible absence of epidemic thresholds 
for a certain range of power law exponents [S] UJ. We were mainly motivated to 
our studies by the paper of Pastor-Satorras and Vespignani [S], where on largely 
heuristic grounds the absence of an epidemic threshold for power-law-graphs with 
exponents less than 3 was claimed. Since the argumentation in this paper was quite 
doubtful from a mathematical point of view but somehow intuitively convincing 
we wanted to study the same question on more solid grounds. To get enough 
theoretical control we developed a new model of power-law-graphs which seems 
to be of independent interest. Due to the strong independence properties - as far 
as this is possible for a scale-free degree-distribution - we were able to carry out a 
rigorous analysis of certain aspects of the model. For a certain range of parameters 
we get a power-law distribution for the total degree. For other parameter values 
there is an interesting domain where one has a fat tail degree distribution with 
gaps. Here the integrated tail distribution is still of power-law form. In both 
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cases the divergence of the second moment of the degree distribution is indeed 
equivalent to the absence of an epidemic threshold. We furthermore examine the 
relation between divergence of the second moment of the degree distribution and 
diameter-like properties of the relevant graph-spaces. Namely one has in the case 
of absence of an asymptotic epidemic threshold the striking property that there is 
a huge (meaning a positive fraction of the whole population) cluster of finite- size 
independent- diameter. This clusters carries the epidemics and forms a kind of very 
small world inside the small world of the largest connected component which can 
still be of logarithmic diameter. 

We further study some models with completely different degree distribution 
where the absence of an epidemic threshold is not necessary linked to the divergence 
of the second moment of the degree-distribution. 

2. A PREFERENTIAL-CHOICE MODEL OF SCALE-FREE GRAPHS 

In the following we want to describe the basic structure of the model we consider 
in this paper. Our first aim is to construct a family of static random graph models 
in which vertex degrees are distributed power-law like, while edges still have high 
degree of independence. As usual in random graph theory we will entirely deal with 
asymptotic properties in the sense that the graph size goes to infinity. 

We consider graphs with vertex set V = V n = {1, n} where an edge between 
the vertices x and y (denoted by x ~ y) is interpreted as a persistent contact 
between the two nodes. Given x e V, its degree is denoted by d(x). We will 
think of edges as generated by a pair-formation process in which each vertex x - 
often denoted as an individual - chooses a set of partners according to a specified 
x-dependent rule. Therefore the set of individuals which have contact with a given 
vertex x can be divided into two -possible non-disjoint sets: the set of nodes which 
are chosen by x himself and the set of nodes which have chosen x as one of their 
partners. We call the size of the first set the outdegree d out (a:)of x and the size 
of the second one the indegree di n (x) of x. Obviously d(x) < d out {x) + di n (x) and 
if the choices are sufficiently independent one can expect equality to hold almost 
surely if n — > oo. 

We partition the set of vertices V n into groups {Ci(n)} i>l where all members 
of a group Ci(n) choose exactly i partners by themselves (d ou t = i on d(n)). Let 
Pa ( n > j) the probability for x to choose a fixed partner y e Cj (n) if n partners are 
available for the choice and just one choice will be made be 

(2-1) P*(n,j) = A a (n)£ 

Here A a (n) is a normalization constant such that A a (n) ^X)i>i |Ci( n )lir) ~ ^ an< ^ 

a is a real parameter. Since we want A a (n) — ► A Q ,as n — > oo we need |C»(n)| — 
to be bounded as a function of n which will impose later on constraints on the 
constant a. The parameter a acts as an affinity parameter tuning the tendency to 
choose a partner with a high outdegree or low outdegree. If a = 0, choices are made 
without any preferences and A a {n) = 1. For a > the "highly active" individuals 
are preferred whereas for a < the "low activity" individuals are favored. From 
this we obtain the basic probability 

i ■ j a 

Pr[x from d (n) has chosen y 6 Cj (n) as one of his partners] ~ A a (n) 
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Concerning the size of the sets Cj(n) we will make the following assumption: 

(2.2) ^ =:*(») -> * 

With this choice we have to impose the restriction a < 7 — 1 to ensure the 
convergence of A a (n). Wc require furthermore 7 > 2 throughout the paper since 
otherwise the expected indegree for individuals from a fixed group would diverge. 
The basic probabilities together with the fixed outdegree-distribution define a prob- 
ability distribution on each graph with vertex-set V n , and therefore a random graph 
space Q n (a, 7). 

We want to investigate the threshold properties of epidemic processes on these 
random graphs as a function of the parameters 7 and a. Here we are mainly 
interested in the question whether the epidemics is asymptotically (n — > 00) sub-or 
overcritical. For the threshold study in this random graph model it is enough to 
investigate the epidemic process as a multi-group branching process since typical 
graphs have locally a tree structure. Absence of a threshold means here that for 
arbitrary small transition probability fi > one still gets for n — > 00 an overcritical 
epidemic process. In section 3 we will compare the results for the above model with 
some specific graph models which have no power-law-distribution but still diverging 
second moments of the degree distribution. 

First we want to compute the important pairing probabilities which will be used 
frequently. We start with the easier case a = 0. 

(2.3) p v{x ~y\ xeCi ;y€C k ) = 1 ^- 1 ^ — *±* 

n n n— >oo n 

Completely similar one can compute the corresponding probabilities for a ^ 0. 
Dropping the simple details we just state the result: 

(2.4) Pr(x~y\xea ; y £ C k ) ~ Aa ^— + ^ as n -> 00. 

n 

It turns out, that the typical graphs in this model still have for a < 2 a power- 
law distribution for the degree with an exponent which can be different from the 
exponent of the outdegree. For a > 2 we obtain a degree distribution which follows 
in mean a power law but has gaps. To compare both domains we will use the inte- 
grated tail distribution F k := Pr (d (x) > k) . We will show that in both cases we get 
the same integrated tail distribution. Since we are interested in the dependence of 
the epidemic threshold from the power-law exponent of the total degree distribution 
we have to analyze how this exponent varies with the two parameters a and 7. Since 
the partner choice is sufficiently random and not too strongly biased toward high 
degree individuals (that's the meaning of the condition a < 7 — 1) it is easy to see 
that the indegree distribution of a vertex from group Cj converges for n — > 00 to a 
Poisson distribution with mean const ■ i a . There are essentially two regimes in the 
parameter space, one for which the expected indegree is of smaller order than the 
outdegree over all groups and one where the indegree is asymptotically of larger 
order. In the first case it is clear that the indegree is too small to have an effect on 
the degree distribution exponent. In other words: the set of individuals with degree 
k consists mainly of individuals whose outdegree is of order k. An easy estimation 
using the formula for the pairing probabilities shows that the expected indegree of 
individuals from group % is given by E (c£j„ (x) \ x E Cj) ~ const ■ i a asymptotically. 
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Therefore the indegree is of smaller order than the outdegree if a < 1 . In the case 
7 — 1 > a > 1 the set of individuals with degree k consists mainly of individuals 
from groups with an index of order k~ . Furthermore we will show that the con- 
dition for absence of an epidemic threshold is precisely given by the divergence of 
the second moment of the degree distribution. 

We will state now the main result of the paper which will be shown in the next 
sections. 



2.0.1. Main Results: a) for a < 1 the random graph G n (ot, 7) has a power-law 



distribution for the degree with exponent 7 

b) for 1 < a < min (2; 7 — 1) the random graph Q n (a, 7) has a power-law distri- 
bution for the degree with exponent 7' = 1 + — — — 

c) for 2 < a < 7 — 1 the random graph Q n (a, 7) has a power-law integrated tail 
distribution with exponent ^ — ^ 

d) there is an epidemic threshold for the parameter set {a < -^,7 > 3} and 
absence of an asymptotic threshold on the complement 

e) for a > 2^-!- or 7 < 3 the second moment of the degree distribution diverges. 



2.1. The degree distribution and second moment estimation. In this para- 
graph we want to estimate the degree distribution, respectively the integrated tail 
distribution for the parameter domains 1 < a < 2 and 2 < a < 7 — 1 and give 
conditions for the divergence of the second moments. We first note that for a > 1 
the indegree dominates the outdegree in every group C{. Furthermore for n — > 00 
and any groupindex i the indegree distribution converges to a Poisson distribution 
with expectation A (i) = const ■ i a . Since the variance is of order i a it is clear 
that for a > 2 and large i there will be essential no overlap between the indegree 
distribution in group i and group i + 1. 

We first deal with the indegree distribution for a < min (2; 7 — 1) : 



(2.5) Pr {d in (x) - k) ~ ^Pr(d m W = fc|seC,; 

i 

(2.6) = Y ^t^le-oonst^ . £1 

y J ^ k\ it 

i<k 



£l 



. The essential interval of indices i which contribute for sufficiently large k to the 
sum is given by the condition k G const ■ i a — a (k) i~ ; const ■ i a + a (k) i 
where a (k) is a slowly growing function in k (for instance log A; is fine). Solving 
for i and denoting the above constant by A the we get for the boundaries of the 
essential index interval I ess (fc) :~ [i m j n (fc) ;« max (fc)] : 

, . 2. 

(2.7) Wfc) : =,-^ + ^ + * 

(2.8) w(fc) : =1^+(P^ + J 
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. For the length \I ess (fc)| of the essential interval we get therefore 

2a (k) 



(2.9) 



\Iess (k)\ 



(k) 



71 1 

A 2 afcs 



oil 



. For a < 2 it follows that I ess (k) n I ess (k + 1) ^ for all fc sufficiently large. In 
contrast to this we have for a > 2 the situation that certain k-values do essentially 
not appear since |J ess (fc)| converges to zero as k tends to infinity and I ess (k) has to 
contain an integer to give a contribution. We estimate now the sum in the indegree 
distribution for large k by restricting the summation to the essential intervals: 



(2.10) 



Pr (d in (x) = k) ~ 

ieless(k) 



k\ 



Cl 

if 



. Using stirlings formula k\ = (2nk) 2 (|) fe (l + o(l)) and the approximation 
(1 + |) fe = e ^-§l+°(i) for | e | < fcf wc obtain: 



(2.11) 
(2.12) 



Pr (d in (x) = k) 



2a (k) k ±_i 1 + 0(1) 



a 



(27rfc) ! 



fc 1 +i-H+°( 1 ) 



. We get therefore an asymptotic power law exponent 7' = 1 + for the indegree 
distribution which is also the exponent of the total degree distribution in the case 
a > 1 since then 7' < 7 and hence the indegree dominates the outdegree. 

It is much more easy to estimate the integrated tail distribution. We will do 
this again for the indegree distribution and a > 2 to be able to compare with the 
situation for smaller values of a. Clearly one has 



(2.13) F k = Pr (d^ (x)>k)=Y,Y,Y, Pl (*« W = l \ x e C ^ 

i xeCi l>k 

Cl 

if 



(2.14) 



]T^Pr(d m (x)=l\x&C i )^ 



i l>k 



. Since for the degree I > k only vertices with indices i > fc= + 

contribute and in this case the sum ^2 l>k Pr (d in (x) = I \ x € Cj) is essentially 1, 
wc get 



(2.15) 
(2.16) 
(2.17) 



^Pr(d m (x)^|*e^) 
.[ fc i(l4.(*i))j'>* 

ECl 
if 

.[ k i(l+o(ki))\ 

Cl Cl 



Cl 

if 



^(l + oOf- 1 ^-i(l + c(^-i)) 



Therefore the integrated tail-degree distribution is still of power-law form. 
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Next we want to estimate the second moments M™ of the indegree distribution 
for values 1 < a < 7 — 1. By definition we have 



(2.18) 
(2.19) 



MX 



E fc2 Pr ( d ^ = k 1 x e G ^> 



xed 



^^fc 2 Pr(d(x) = fc|xeC,)^ 



and asymptotically since the second moment of a Poisson distribution with expec- 
tation A is given by A 2 — A : 



(2.20) 
(2.21) 



MX 



Ai a 



E 



/ const 



2a 



const\ 



. The first term in the sum diverges for a > which is of course also the 
condition for the total degree second moment to diverge (in the case a < 1 the 
outdegree dominates the indegree and the second moment diverges for 7 < 3). We 
show in the next paragraph that this is precisely the condition for absence of an 
epidemic threshold. 

2.2. The branching process approximation. In the following we want to an- 
alyze the branching process approximation for the epidemic process on the above 
described random graph space Q n (a, 7). Due to the absence of local cycles (in 
the large n limit) one gets an exact threshold estimation via the branching pro- 
cess approximation. Let n be the transmission probability that an infection will 
be transmitted along a given edge of the graphs. Eliminating edges in Q n (0,7) 
with probability 1 — fi gives a new random graph space G„ (a, 7) which we call the 
epidemic random graph. We will denote expectations with respect to Q% (a, 7) by 
E^ whereas E is reserved for expectations with respect to Q n (a, 7). 
Let T (a, 7, n) = (ay (a, n)) be given by 

(2.22) a l3 (a, n) := E M (# -[y : x ~ y} \ x & C % (n) ,y e C j (n)) 



(2.23) =ti-E(%{yx~y}\x£C i (n),yeC j (n)) 

and let (a) := linin^^ a^- (a,n) . Note that T (a, 7, n) is the transposed of the 
branching process transition matrix. In other words (a) is the expected number 
of infected individuals in group j generated by one infected individual in group i. 
Actually the exact terms in the asymptotic branching process matrix are given by 
dij (a) — 1. Since the difference by one is irrelevant for our considerations we neglect 
this term in the computations. From formula (|2.4(l we conclude that 

(2.24) Q iJ (a,n)^C 1 -/x- * ja ^ Jt ° 

where the constant C\ depends only on a and 7. Absence of an epidemic threshold 
for fixed a and 7 holds exactly if 

(2.25) lim A max (T (a, 7, n)) = 00 for u > 

n — >oo 

with A max (.) being the largest eigenvalue in modulus. 
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The regions in the parameter space for which we will get absence of an asymptotic 
epidemic threshold are 1) 2 < 7 < 3 and arbitrary a and 2) the parameters with 
7 > 3 and a > Finally we get for 7 > 3 and a < that there is a size 

independent threshold. 

Since the smallest row sum of a positive matrix gives a lower bound on the largest 
eigenvalue it is enough to show that for (bik (n)) := T 2 (a, 7, n) and a and 7 as in 
case 1) or 2) we have 

(2.26) min> hkM — ► 00. 

fe>l 

Note that ay (a, n) ~ ay (a) and therefore 

(2.27) 6 ;fc (n) ~ J] ok (a) a ik (a) 

h q + u a ik a + ki a 



(2.28) = (^r) 2 ^ 



, ^ , 2/ lk a + l a k ^i a+1 
(2-29) = (Ka) 2 (— — + 

i 

Zfc i 2a l a k a ^ i 2 N 
2.30 — + ^ — > — • 

. From this formula one immediately concludes the absence of an epidemic threshold 
in the above described domains since E 4^ respectively E ^ diverge for 7 < 3 

respectively a > whereas the divergence of E L w i s irrelevant since 
> 7 - 2 for 7 > 3. 

To show the existence of an asymptotic threshold for parameters satisfying 7 > 3 
and a < we will demonstrate that for /x > we have lim A max (T' m (a, 7, n)) < 

m— >oo 

co from which the assertion then follows. Let (gj^ (n)) :— T 171 (a, 7,n). We will 
derive a recursion for (n) as follows. Since 

(2.31) 5ifc (n) ~ (itCi) 2^ It -■ ^7 Ui 

l m ...ii 

we define variables 

(2.32) g lk (n,x,y) := (nd) ^ r? ~ Ui 

and observe that 

(2.33) g% +1 (n,x,y)=»C 1 -gr k (n,x',y') 
where 

(2.34) x' = B x + B iy ; y' = B 2 x + B a y 

and Bo = E ^ ^1 = E and B 2 = £ £• Since 5 ™ (n, fc a , fc) ~ g% (n) we 



only have to show that /zCi • A„ 



A So 




^ B 2 


B J 



< 1 to establish the boundedness 



of A max (T m (a, 7, n)). Clearly if B ,Bi and B 2 stay bounded as a function of n we 



8 



PH.BLANCHARD, C.-H. CHANG, AND T. KRUGER 



can match the requirement of the last sentence. But Bo, B\ and B2 stay bounded 
on the parameter set 7 > 3 and a < from which our assertion follows. 

The result has a nice heuristic explanation. The "tail" -groups (that is the groups 
d with i very large) act as a hub for the infection. Indeed given an infected 
individual in group Ck it spreads the infection into the hub via it's outdegree edges 
(for a large) from which the infection is backspread via the indegree edges of the 
hub to Ck or any other fixed group. It is precisely this infection path which is 
proportional to Yl tt an d therefore makes the branching process divergent for 

a > 27+I. 

3. The very small world inside the small world 

In the following we will show that the distinction between absence and pres- 
ence of an epidemic threshold can be exactly related to properties of the diam- 
eter of the giant component Q n (a,-f) of Q n (a,j). Of course the diameter of 
a random graph space is itself a random variable but it turns out that it has 
a small variation. For our purpose we therefore concentrate on the expected 
diameter. Given a graph G n with n vertices and a subset A of V n we define 
diam (G ni A) := max {d (x, y) \ x,y e A} where d (x, y) is the distance on the graph 
G n - Let the e— essential diameter of a graph G n be defined as diam% ss (Gn) '■= 
min {diam (G n , A) \ \A\ > (1 — s) \V n \}- Similar if we have a sequence of random 

graph spaces {Q n } the quantity diam% ss (Q n ) is now a random variable depending 
on the realization G n € Q n - 

The striking phenomenon in the case of absence of an epidemic threshold for the 
above described parameter region is the following property: 

For all e E (0, 1) there is a C (e) > such that Pr {diamf 8 (G n (a, 7)) < C (e)} 
— > 1 as n tends to infinity. 

In other words: A positive fraction of the vertices are located in a cluster of 
finite diameter. This cluster has a kind of fuzzy, onion like structure with no sharp 
boundary. At the "center" of the cluster are the very high degree groups placed 
which are entirely contained in the cluster. The union of shells which are at distance 
less than R from the center has positive mass as soon as R gets larger than a critical 
value Co- Increasing R makes this fraction larger and larger but to get the whole 
giant connected component in the cluster requires to take R — > 00 as n goes to 
infinity (something between log log n and logn should be the right growth rate). 

Before going into the details let us recall the general philosophy how to estimate 
the diameter of a (large) connected component of a random graph space. Fixing 
an individual x one tries to estimate the expectation of Tk := )J {y : d(x,y) = k} 
as good as possible. As long as J2 i<k Ti -^i n holds and the variation of the Tk is 
under control one can use in general a branching process approximation to get the 
right order of the expectation of Tk- The smallest ko such that ^2 i<ko ~ n gives 
then a very good estimation of diam e £ 8S (for e small). 

To see that the above statement is true let us note first that iterating the branch- 
ing process matrix fc— times -denoted in the following by (t\j^ and being the 

transposed of T k (a, 7) — gives the expected size of the number of individuals in 
group i which are at distance fc to a random chosen individual in group j pro- 

(k') 

vided that all expectations XXj are mucn smaller than \d (n)\ for fc' < fc. 

3 
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Assume now that the maximal outdegree respectively the largest group index in 
our model scales like n* 3 for some 0< (3 < ^ and hence the size of the group 
with the maximal outdegree is of order const ■ n 1 "^ 7 . Since the dominating con- 
tribution in the terms comes from I I we can safely iterate the 

branching process matrix fc— times as long as fc/3(l — 7 + 2a) < 1 — /3-y (note 
that in the "no-threshold" case we always have 1 — 7 + 2a > 0). Let fc be the 
largest k for which the above inequality holds. To get a good lower bound on 
the number of individuals at distances larger then fc one has to truncate the ma- 
trix properly in such a way that only paths are taken into account which make 
no use of groups which are smaller than the entries in the iterated branching pro- 
cess matrix. Therefore we consider in step ko + k only those groups Cj for which 
i < rJ i] * with f3k ■= y+2a )(ka+k)+-y ■ ^he leading term after k + k iterations 

is now given by n fco/3(l-7+2a) JT n ft(l-7+2«) = n *=o/3(l-7+2a)+Ei<fc ( i_ 7 + 2 ~^+*)+-r . 

l<k 

Since the sum in the exponent diverges there is a size independent k\ such that 
n fco/3(i-7+2a) Y[ tjA( 1 -t+ 2q ) ~ n and consequently a finite fraction of the vertex 

Z<fci 

set stays within a bounded distance. 

Let us finally demonstrate that there cannot be a C* such that essentially all 
vertices of the giant component arc within a distance less than C*. Fix / > 
and an individual x with d(x) < f. From formula (2.4.) it follows that there is 
a n independent p (/) > such that the probability that x has only partners y 
with d (y) < f is larger p (/) > 0. Therefore the probability that at distance less 
equal C* from x there are only individuals with degree less than / is larger than 

fC*+l 

p (/) . Since there are const (/) • n individuals with degree less than / there 
is a small but positive fraction of individuals which are at distance larger than C* 
from the high degree groups for arbitrary large C* . In a similar way one can show 
that there are individuals with distance of order In In n (but those form only a set 
of asymptotic zero density). 



4. A TOY EXAMPLE 

In the following section we want to discuss a model in which the degree-distribution 
is somehow extreme, namely there are only two types of individuals with sizes C 2 
and C/( n ) . The index 2 and / (n) stands for the degree of the two individual- types 
the last one depends on the size n of the vertex set. We will make the general 
requirement that the second moment of the degree distribution diverges whereas 
the first moment converges: 

I C I 

(4.1) Urn sup /(n) 1 /(n)l < 00 



(4.2) [/(„)]» oo. 

We will look at the epidemic process initiated by one randomly infected individ- 
ual in group C/( n ) an d ask wether one has a persistent epidemics for arbitrary small 
transmission probability. For this model the question of existence of an epidemic 
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threshold is only interesting in the above conditional sense since due to our assump- 
tions the asymptotic fraction of C/(n) individuals on the total population tends to 
zero and no epidemics can be persistent among the Ci population for sufficiently 
small transmission probabilities. 

We will investigate two different structures of edge formations. 

4.1. Case A. In this subsection we want to separate the elements of Cfi n ) as far 
as possible from each other which gives the epidemiologically "safest" structure in 
the sense that unless the first moments of the degree-distribution " almost" diverge 
one would expect an undercritical situation if the transmission probability is suffi- 
ciently small (note that if the first moment of the degree-distribution diverges one 
gets trivially no threshold). We want to separate all elements of C/(„) by chains of 

\Ct I 

uniform length / (n) consisting of elements of C2 . Since we have / (n) 1 / 2 < " ) 1 chains 
(we drop all effects due to integer part requirements since they are asymptotically 
irrelevant) we get for the length I (n) = — 1 ^ ^ + 1. The probability to transmit 

an infection along such a chain is /i'*™) and hence the expected number of indi- 
viduals in Cj( n ) which will be infected from one initially infected individual from 
group Cf(n) equals / (n) fJ^V. To get absence of a threshold we need therefore 
/ (n) zz^™) > 1 asymptotically. This is equivalent to log/(n) — c (" l^/wl) > q 

|C/(„)|/(n) 

with c = 21ogi from which by l|4.1|l O (n) log / (n) > c • (n — |C/( n )|) follows. 
First we observe that it is impossible to find exponents (3, 5 £ (0, 1) s.t. / (n) := n@ 
and |C/(„)| := n 5 and / (ri) jj, l ( n ' — + 00 under the assumption f) + a < 1. Note 
that (3 + a = 1 is just the borderline where the expectation of the degree starts to 
become large. 

As a conclusion from the considerations in this section one should probably take, 

that it is the balance between separation length and degree growth which matters 

for the epidemiological threshold-question. So if the elements from C2 would not 

be as uniformly spread we should look at the following quantities: ipi [/ (n)] := 

E x [tt (y £ Cf(n) '■ dist (x,y) = /)] where disto denotes the length of the shortest 

path between x £ Cjr n ) an d V which entirely consists of individuals from C2. For 

getting an overcritical epidemics we therefore need A (n) := ^ tpi [/ (n)] /j, 1 > 1. 

;>o 

4.2. Case B. Assume now that we have stationary asymptotic probabilities poo, 
Poii P10 an d Pu for the events that a single random chosen edge from an x £ C^(„) 
points to an element from Cfi n ) (poo) or to an element from Ci (poi) respectively 
that an edge from an element from C2 points to C^(„) (P10) or to an element from 
C2 (pn)- There are three conditions on the choice of the pij namely poo + P01 = 
liPio + P11 = 1 an d P01 ' / (n) |C/(n) I = P10 ' 2 \C%\. Since the first moments are 
bounded we have |C/(„)| = o (n) and therefore \C-i\ ~ n. To get the threshold 
condition we have to estimate the expected number of secondary infections caused 
by an infected individual from C/(„) which is precisely given by the quantity A (n). 
To compute the numbers (pi [/ (n)] observe that 

(4.3) Pr (a fixed chain starting at x has length I, I > 1) = poip^pw 
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and Pr (a fixed chain starting at x has length 1) = poo- Therefore we have 

(4.4) W If ' M 1 = / (») ( Woo + J2 ^PoiPn Vo J 
z>i \ ;>i / 

i 

(4.5) = / (n) woo + f (n) fi P01P10 



1 - MPn 

(we dropped the n-dependence of the Pij). Clearly the direct coupling between 
the Cf(n) individuals has to be of order at most j^y to avoid a nontrivial epi- 
demics inside the C/(n) population which implies f (n) |C/( n )| *C C2 and in turn 
pio -* 0. Using the relations between the p^ we have A > fi 2 ■ /a( a|^j w| - From 

the divergence of the second moments one gets - — > 00 and hence ab- 
sence of an epidemic threshold in this probabilistic setting. It remains to estimate 
the asymptotic number / (n) of infected individuals which equals the size of the 
largest component of the epidemic random graph. We first estimate the num- 
ber of infected individuals in C/( n ) denoted by If via the stationarity equation 

l^/(")l (l — A(n) ) = If ( n )' Furthermore an infected individual from Cy(„) in- 
fects in mean X);>i / ( n ) ' A* = / ( n ) 1^77 individuals from C2 from which one 
obtains the following asymptotic bounds on the number of infected individuals: 



\ \ C f( 



l-Ak)f(n)j^<I(n)< \C f(n) I (l - zfo) / (n) ^ Therefore 

unless one has lim n _ >00 ^ C ' /( |^^ ( '"' > > (in this case one gets trivially absence of a 
threshold since poo > 0) the size of the asymptotic fraction of infected individuals 
in the case of absence of a threshold still converges to zero. 



5. Conclusions and comments 

We have shown that for the preferential choice model described in this paper 
the divergence of the second moment is precisely the condition for absence of an 
epidemic threshold even in the case when the asymptotic distribution is not of a 
power-law form (but has gaps). In terms of graph geometry this is related to the 
presence or absence of a massive finite diameter cluster which carries the epidemics. 
It seems that for the case of an exponent larger than three one always has a 
size independent threshold if one additionally requires that the size of the largest 
component of the epidemic random graph is proportional to fj, ■ n. Note that it is 
easy to get absence of threshold for exponents larger than three if one introduces 
horizontal preferences in the partner choice (for instance a strong bias to make 
connections into the own group) but in this case the size of the largest component 
scales like fi 1+e ■ n with an e > 0. It is a much more difficult task to estimate 
exactly the size of the giant component on the epidemic random graph which will 
be the subject of a more mathematical forthcoming paper j^j. Certainly it scales 
like const ■ u ■ n. It would be furthermore very interesting to have random network 
growth principles which produce in a natural way the scale free graphs we described 
in this paper. For some results in this direction see JUj- In any case the model 
seems to be interesting in describing computer networks and associated spread of 
viruses on it. Caution is required if one wants to draw conclusions for sexually 
transmitted diseases like HIV on social networks even when these networks have 
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a power law tail distribution for the degree. The main point here is that the 
transmission probability scales with the degree for small transmission probabilities- 
like in the AIDS-epidemics- since high degree individuals spend necessarily less 
time with each single partner 

Finally we would like to mention that there is a natural variant of our model 
by defining independent directed edge probabilities between x S C, and y 6 Cj 
as Pij = with the same A a as above. For this model one gets the same 

threshold statements since it has the same branching process transition matrix. 
For some alternative strategies to choose partners and the corresponding threshold 
estimations see also|ll||5). 
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